A Neutral Semantic Representation for Data Model and Schema Translation
نویسندگان
چکیده
In order to achieve the interoperability of heterogeneous database systems, a semantics-preserving translation of the modeling constructs and constraints captured by different data models and defined in different schemata is a necessity. It is difficult to translate the constructs and constraints of one model directly into those of another model because 1) their terminologies and modeling constructs are often different, and 2) being high-level user-oriented models, their modeling constructs may have a lot of implied semantic properties which may or may not correspond to those of the others. If these high-level constructs and constraints are decomposed into some lowlevel, neutral, primitive semantic representations, then a system can be built to compare different sets of primitives in order to identify whether these constructs and constraints are identical, slightly different, or totally unrelated. Discrepancies among them can be explicitly specified by the primitive representations and be used in the application development to account for the missing semantics. In this paper, we present a neutral data model ORECOM which has been used in the development of a data model and schema translation system. The model is object-oriented and provides a small number of general structural constructs for representing the structural properties of high-level modeling constructs including those of object-oriented data models. It also provides a powerful knowledge rule specification language for defining those semantic properties not captured by the general structural constructs. The language is calculus-based and allows complex semantic constraints to be defined in terms of triggers and the alternative actions to be taken when some complex data conditions have or have not been satisfied. This paper also presents eight basic constraint types found in many semantically rich data models. These constraint types are represented in the neutral model by parameterized macros and their corresponding micro-rules. Parameterized macros are compact forms of neutral semantic representation which are used in an implemented system for comparing and translating the modeling constructs and constraints of different data models. The corresponding micro-rules are the detailed semantic descriptions of the constraint types. The translation of many modeling constructs and constraints found in several popular models into the neutral representation is illustrated by examples.
منابع مشابه
An Improved Semantic Schema Matching Approach
Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...
متن کاملA Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کامل1 an Object - Oriented Rule - Based Approach to Data Model and Schema Translation
To achieve data sharing among heterogeneous database management systems, one essential step is the conversion of the schemata defined in the diverse data models used by these systems. A semantic preserving translation of different modeling constructs and constraints is necessary to ensure that the semantics of applications remain intact after the translation. In this paper, we present an object...
متن کاملSchema Transformation for Semantic Geodata Translation
We address the problem of semantic geodata translation in a file-based data sharing environment. By Semantic translation, we mean the translation process that can resolve semantic heterogeneity between source and target data set. First we identify sources of semantic heterogeneity of spatial data. Then a classification of semantic heterogeneity using OO data model as the common framework is int...
متن کاملNeuron Mathematical Model Representation of Neural Tensor Network for RDF Knowledge Base Completion
In this paper, a state-of-the-art neuron mathematical model of neural tensor network (NTN) is proposed to RDF knowledge base completion problem. One of the difficulties with the parameter of the network is that representation of its neuron mathematical model is not possible. For this reason, a new representation of this network is suggested that solves this difficulty. In the representation, th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009